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FIELD OF THE INVENTION 



The present invention relates to a method for coding an input digital video sequence 
corresponding to a color image sequence comprising a luminance component with luminance 
values, and having a spatial representation, said method comprises the following steps: 

- a transformation step, provided for transforming said video sequence from the 
original spatial representation domain into less representation data comprising 
transformed luminance values ; 

- a quantization step, provided for performing a quantization on the representation 
data thus obtained a reduced set of data. 

The Invention also relates to an encoder, said encoder implementing said method. 
Such a method may be used in, for example, a video communication system. 

BACKGROUND OF THE INVENTION 

A video communication system, like for example a television communication system, 
typically comprises an encoder, a transmission medium and a decoder. Such a system 
receives an input digital video sequence corresponding to an original color image sequence, 
encodes said sequence via the encoder, transmits the encoded sequence also called bit 
stream via the transmission medium, then decodes the transmitted sequence via the 
decoder resulting In an output digital video sequence. 

The input digital video sequence has a spatial representation associated. In classical 
video approaches, a spatial representation comprising 3 different components: luminance Y, 
chrominance U and chrominance V. The luminance component Is represented by different 
gray levels, in general 256 gray levels. 

In order to transmit only the necessary information of the digital video sequence, 
the encoder reduce the spatial representation into less representation data and then 
performs a quantization of this reduced representation data. 

In order to improve the rate/distortion ratio, that is to say the bits rate used for 
encoding versus the distortion perceived in the decoded image sequence from the original 
Image sequence, several quantization solutions have already been proposed in the prior art. 

One of them is described in the document referenced "H.G. Mussmann, P. Pirsch 
and H-J Grallert Advanceds in picture coding" in the IEEE proceedings, vol. 73, N°.4, 523- 
548 April 1985". The solution of the prior art is based on activity measure using activity 
functions. A typical example of activity function is to compute the maximum difference 
between neighboring pixels within an area of an image sequence. If the maximum is lower 
than a threshold value, it means that there are some homogeneous luminance values within 
this area, and then this area is considered as having no activity. By means of more complex 
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activity functions, an image sequence can be divided into several segments, on which 
different quantizations are performed. In this case, an adaptive quantizer can be realized by 
a set of separate sub-quantizers, one for each segment, to whom some specific activity 
values are associated. 

5 All these quantization solutions minimize the distortion of an image on average, over 

all original values of said Image. Thus, they propose to have few or no reproduction values 
in locations at which the probability of apparition of gray levels, for example, within the 
video signal is negligible, whereas at highly probable of apparition of gray levels, more 
reproduction points need to be specified. 

10 

Given that the aim of image coding is to reconstruct an input image with the best 
possible visual quality, one major inconvenient of these solutions of the prior art is that they 
result only to an approximate fit to the perceptual response of the human eye. 



15 SUMMARY OF THE INVENTION 

Accordingly, it is an object of the invention to provide a method and an encoder for 
coding an input digital video sequence corresponding to a color image sequence comprising 
a luminance component with luminance values, and having a spatial representation, as 
claimed as in the preamble part of claim 1, which improve the visual quality of the 

20 reconstructed input digital video sequence with a good rate/distortion. 



To this end, the quantization step of the method performs a quantization of the 
luminance component in an adaptive way according to a visible range of transformed 
luminance values of said luminance component in order to obtain said reduced set of data. 
25 In addition, the quantization means within the encoder are adapted for performing a 

quantization of the luminance component in an adaptive way according to a visible range of 
transformed luminance values of said luminance component in order to obtain said reduced 
set of data. 



30 As we will see in detail further on, the invention is based on the fact that under 

standard viewing conditions, human eyes cannot distinguish some transformed luminance 
values in some certain ranges. Therefore, with this principle, the quantization step will be 
adapted according to the perceptual properties of a human eye, and more particularly to 
said visible range. 

35 

BRIEF DESCRIPTION OF THE DRAWINGS 
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Additional objects, features and advantages of the invention will become apparent 
upon reading the following detailed description and upon reference to the accompanying 
drawings in which the Figure illustrates how a quantization can be performed on 
transformed luminance values within or outside a visible range by the encoder according to 
the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

In the following description, well-known functions or constructions by the person 
skilled in the art are not described In detail since they would obscure the invention in 
unnecessary detail. 

The present invention relates to a method of coding an input digital video sequence 
corresponding to an original color image sequence comprising a luminance component with 
luminance values, said method being used in particular in an encoder within a video 
communication system. Said system receives some digital video sequences. 

In order to transmit efficiently the input video sequences through a transmission 
medium, said encoder applies an encoding. Said encoding sequences known as bit stream 
are sent to a decoder, which decodes and reconstructs the original video sequences. 

It is to be noted that the spatial representation data is often a YUV luminance and 
chrominance representation well known by the person skilled in the art, with the luminance 
component being represented by 256 gray levels. 

Such encoder comprises: 

- transformation means for transforming said video sequence from an original 
spatial representation domain into less representation data comprising 
transformed luminance values; 

- quantization means for performing a quantization on the representation data thus 
obtained a reduced set of data ; and 

- encoding means for coding said reduced set of data. 

The encoding of an input digital video sequence Is done as following. 

In a first step 1) the original spatial representation data, i.e. the YUV luminance and 
chrominance representation, is transformed Into a less representation data, for example in a 
frequency domain by a OCT transform or by mesh method well-known by the person skilled 
In the art. This less representation data leads to transformed luminance and chrominance 



values. 



More particularly for the luminance component, this leads to reduce a set of data of 
256 gray levels coded on 8 bits into corresponding DC and AC coefficients, if for example a 
OCT transform is used, said DCT transform being used on blocks of an image sequence. 

The DC coefficient of a block is the mean of the luminance values of said block. 
Hence, this DC coefficient represents a transformed luminance value. For other transforms 
than the DCT one, the same parallel can be applied. 

Perceptual studies have already shown that, under standard viewing conditions, 
human eyes cannot distinguish small luminance variation (from 1 to 5 gray levels). 

Moreover, perceptual tests performed by the applicant show that, for a luminance 
component including 256 gray levels (from 0 to 255 for example), human eyes are more 
sensitive to luminance changes inside the luminance range [70 ; 130] than in the range [0 ; 
70] or in the range [130 ; 255]. The first range is called the visible range. 

The luminance values and the transformed luminance values that can be perceived 
correctly by human eyes are called respectively relevant values and relevant transformed 
values, whereas the other are called respectively non-relevant values and non-relevant 
transformed values. 

Therefore, in a second step 2), a quantization is performed on the reduced set of 
data, more particularly a quantization is performed on the transformed luminance values of 
the luminance component and this according to perceptual properties described above. 

According to a first non-limitative embodiment, the quantization step performs a 
quantization on the luminance component by calculating the probability of appearance of 
transformed luminance values within the video sequence as previously mentioned in the 
prior art, but a heaviest weight of probability Is applied first to the transformed luminance 
values, which are in the visible range. Thus, the transformed luminance values in the visible 
range will be taken into account in a more .adequate way for the human eye. than if there, 
was only a common probability calculation applied. At the end, the less representation data 
is transformed into a reduced set of data according to said probability of appearance. 

According to a second preferred embodiment, the quantization step performs a 
quantization on the luminance component by applying fine quantization points for the 
transformed luminance values in the visible range, whereas outside the range, coarse 
quantization points are used for the transformed luminance values. 

In the example illustrated in the Figure, there are N transformed luminance values 
for the luminance component and M points are used for quantization. 



If the visible range is [a, p], K quantization points KO to K8 will be used to perform 
the quantization of the transformed luminance values in this range. Either one quantization 
point will be attributed to one transformed luminance value, either for example, one 
quantization point is attributed for a very small set of transformed luminance values, for 2 
transformed luminance values for example. 

These K points can have exactly the same values of the corresponding transformed 
luminance values, the dynamic of the luminance component dynamic Is kept unchanged, or 
not In the example, a = 70, and p = 130. 

Outside the visible range, i.e. in the range [0, a] and in the range [p, N-l], L 
quantization points are used to perform the quantization of the transformed luminance 
values by Interval. For example if the transformed luminance values are from 0 to 15, one 
quantization point L0 will be attributed to this interval. From 15 to 30, a second quantization 
point LI will be attributed, etc. Hence, outside the visible range, the quantization is very 
coarse. Although the non-relevant transformed values of luminance are degraded, the 
human eye wont see any difference. 

Thus, to a big cluster of transformed luminance value is attributed a unique 
quantization point outside the visible range, whereas a quantization point Is attributed to 
one or a smaller cluster of transformed luminance values, which are In the visible range. 
Such cluster In the visible range comprises far fewer transformed luminance values than a 
cluster outside the visible range. 

Thus, the quantization of the luminance component has been done in an adaptive 
way, because it was not uniform for all the transformed luminance values, but one fine 
quantization has been performed for a certain range of luminance values, and a second 
coarse quantization has been performed for another ranges of luminance values. 

In the last step), the reduced data obtained by the quantization representation is 
coded, for example by variable run-length coding well known by the man skilled in the art, 
which consists in associating some symbols to some suite of values on which a quantization 
has been performed. 

At the decoder side, the decoding Is done to reconstruct the original image, taking 
Into account the quantization points as described previously. The human eye won't see 
much distortion between the output image obtained and the original image. 

Thus, one advantage of the present Invention is to improve the rate/distortion by 
encoding more information with the same bit budget than the prior art, or less Information 
with far less bits but without losing any quality in the encoding. Indeed, as a fine 



quantization Is performed on all the relevant transformed values of luminance, the quality of 
the image is not lower. Moreover, the new representation of image has been chosen such as 
the reconstructed video signal matches perfectly the visual capacities of a human observer* 

5 It is to be understood that the present invention is not limited to the aforementioned 

embodiments and variations and modifications may be made without departing from the 
spirit and scope of the invention as defined in the appended claims. In the respect, the 
following closing remarks are made. 

It is to be noted that the quantization step described above according to the 
10 invention can also be applied directly on the luminance values of the spatial representation. 
But, practically, as for the video applications, there is always a compression, It won't often 
be applied directly but on the transformed luminance values only- 
It is to be understood that the present invention Is not limited to the aforementioned 
15 video application. It can be use within any application using a system for coding a digital 
video sequence where the ultimate consumer being the human eye, such as applications 
including digital movies, HDTV, and transmission and visualization of scientific Imagery. 
Image codes have to be designed to match the visual capabilities of the human observer. 

20 It is to be understood that the method according to the present invention is not 

limited to the aforementioned implementation. 

There are numerous ways of implementing functions of the method according to the 

invention by means of items of hardware or software, or both, provided that a single item of 

hardware or software can carries out several functions. It does not exclude that an assembly 
25 of items of hardware or software or both carry out a function, thus forming a single function 

without modifying the method for coding the video signal in accordance with the invention. 
„ said. hardware. or .software items can be implemented In several manners, such as by 

means of wired electronic circuits or by means of an Integrated circuit that is suitable 

programmed respectively. The integrated circuit can be contained in a computer or in an 
30 encoder. In the second case, the encoder comprises transformation means and quantization 

means, as described previously, said means being hardware or software Items as above 

stated. 

The integrated circuit comprises a set of instructions. Thus, said set of Instructions 
contained, for example, in a computer programming memory or in an encoder memory may 
35 cause the computer or the encoder to carry out the different steps of the decoding method. 
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The set of instructions may be loaded into the programming memory by reading a 
data carrier such as, for example, a disk. A service provider can also make the set of 
instructions available via a communication network such as, for example, the Internet. 

Any reference sign in the following claims should not be construed as limiting the 
claim. It will be obvious that the use of the verb "to comprise" and its conjugations do not 
exclude the presence of any other steps or elements besides those defined in any claim. The 
article "a" or "an" preceding an element or step does not exclude the presence of a plurality 
of such elements or steps. 
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.CLAIMS 

1. A method for coding an input digital video sequence corresponding to a color image 
sequence comprising a luminance component with luminance values, and having a 
spatial representation, said method comprises the following steps: 

- a transformation step, provided for transforming said video sequence from the 
original spatial representation domain into less representation data comprising 
transformed luminance values ; 

- a quantization step, provided for performing a quantization on the representation 
data thus obtained a reduced set of data ; characterized in that said quantization 
step performs a quantization of the luminance component in an adaptive way 
according to a visible range of transformed luminance values of said luminance 
component in order to obtain said reduced set of data. 

2. A method for coding an input digital video sequence as claimed in dalm 1, characterized 
in that the quantization step is performed by: 

- applying a heavy weight to the transformed luminance values in the visible range ; 
and 

- computing probability of transformed luminance values appearance within the 
luminance component ; and 

- transforming the representation data into said reduced set of data according to said 
probability of values appearance. 

3. A method for coding an input digital video sequence as claimed in daim 1, characterized 
in that the quantization step is performed by: 

- Using coarse quantization points for the transformed luminance values outside the 
visible range; and 

- Using fine quantization points for the transformed luminance values in the visible 
range. 

4. A computer program product for an encoder, comprising a set of instructions, which, 
when loaded Into said encoder, causes the encoder to carry out the method claimed in 
claims 1 to 3. 



A computer program product for a computer, comprising a set of instructions, which, 
when loaded into said computer, causes the computer to carry out the method claimed 
in claims 1 to 3. 
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6. . An encoder for coding an input digital video signal corresponding to a color image 
sequence comprising a luminance component with luminance values, said signal having 
a spatial representation, said encoder comprises: 
5 - transformation means for transforming said video sequence from an original spatial 

representation domain into less representation data comprising transformed 
luminance values ; 

- quantization means for performing a quantization on the representation data thus 
obtained a reduced set of data, characterized in that said quantization means are 
10 adapted for performing a quantization of the luminance component in an adaptive 

way according to a visible range of transformed luminance values of said luminance 
component in order to obtain said reduced set of data. 



7. A video communication system, which is able to receive an input digital video signal, 
15 said signal being coded by the encoder defined in claim 6. 
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Method and encoder for coding a digital video signal 
ABSTRACT 

The present invention relates to a method and an encoder for coding an input digital video 
signal comprising a luminance component with luminance values. The method comprises the 
steps of: 

- a transformation step, provided for transforming said video sequence from the 
original spatial representation into less representation data comprising transformed 
luminance values ; 

- a quantization step, provided for performing a quantization on the representation 
data thus obtained a reduced set of data. 

The invention is characterized in that the quantization step performs a quantization of the 
luminance component in an adaptive way according to a visible range of transformed 
luminance values of the luminance component 



Use: encoder in a video communication system 

Reference: Fig. 
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